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A SURVEY  OF  STATISTICAL  METHODS  IN  SYSTEMS 
RELIABILITY  USING  BERNOULLI  SAMPLING  OF  COMPONENTS 

Bernard  Harris 

1.  Introduction.  A fundamental  problem  in  the  area  of  statistical  inference 
in  reliability  theory  is  the  statistical  estimation  of  the  reliability  of  a system 
from  experimental  data  collected  on  each  of  the  individual  components.  In 
this  survey,  I will  try  to  describe  the  state  of  the  art  as  I envision  it.  As  I 
will  subsequently  demonstrate,  much  still  remains  to  be  accomplished  in  this 
area,  particularly  in  the  construction  of  statistical  methods  which  are  satis- 
factory when  the  true  system  reliability  is  high. 

The  paper  is  divided  into  sections  as  follows.  In  Section  2 we  de- 
scribe coherent  systems  and  list  some  of  their  basic  properties.  In  Section 
3,  we  discuss  various  measures  of  the  importance  of  individual  components 
in  a system.  The  fourth  section  treats  methods  of  statistical  inference  for 
series  systems,  and  parallel  systems  are  discussed  in  Section  5.  The  sixth 
section  is  devoted  to  arbitrary  coherent  systems  and  the  seventh  section  is 
concerned  with  Bayesian  methods.  A discussion  of  the  use  of  loss  functions 
is  treated  in  Section  8.  Some  concluding  remarks  are  made  in  the  ninth 
section.. 

2,  Coherent  systems.  We  now  introduce  some  definitions  and  notation  which 
will  be  used  throughout  this  survey. 

A system  is  regarded  as  a specified  configui ation  of  components.  To  each 

component  c,  we  associate  the  variable  x , where  x assumes  the  value  0 

c c 
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or  1 . h system  of  k components  is  then  defined  as  a function  <p X % 

X . . . X%,  -*  { 0,1  },  where  each  % = {0, 1}  and  x = x e % ,,  i = 1,  2, . . . , k . 
^ 11  c i 

In  the  customary  language  which  originates  from  the  engineering  ap- 
plications which  have  motivated  the  study  of  reliability  theory, 


and 


Xj  = 1 means  the  component  is  functioning, 
Xj  = 0 means  the  component  has  failed, 


x2, . . . , x]c)  = 


means  the  system  is  functioning 
means  the  system  has  failed. 


The  system  <p  is  said  to  be  a coherent  system  if 

1.  tp(0,  0, . . . , 0)  = 0, 

2.  <p(  1, = 1,  and 


3.  X2>  * * • » xjc)  *s  increasing  in  each  argument. 

We  let  x = (Xj,  x2,. . . , x^)  . Then  we  can  define  the  following  types 
of  coherent  systems. 

_ k 

1.  Series  systems,  <p(x)  = TT  x.  , 

i=l 


>v  . ,-f 

2.  Parallel  systems,  <p(x)  = 1 - TT  (1-x.)  , 


i=l 


~ 4/  ki+i 

3.  Series-parallel  systems,  <p(x)  = TT  1 - TT  (1-x) 
r-1  J=0\  i=k.+l  1 


and  Yj  <}  = k . 


, where  kQ  = 0 


J=0 
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1-1/  ki+l  \ 

, ?(*)  = i - TT  i - IT  x 

j=o\  uyi  7 


, where  kQ  = 0 


4.  Parallel- series  system 
r-1 

and  Yj  k = k . 
j=0  J 

5.  r out  of  k systems,  <p{x)  - TT  x,  TT  <l-x.),  where  A is 

- j ic  A i£  AC  1 J 

any  subset  of  {1,  2,  . . . , k}  with  !a^  I > r,  where  J I A | denotes  the 

cardinality  of  the  set  A,  and  the  sum  is  over  all  such  subsets. 

A series-parallel  system  consists  of  k components  divided  into  dis- 
joint subsets,  each  subset  is  a parallel  system-aod  the  parallel  systems,  re- 
garded as  individual  components, are  combined  into  a series  system. 

The  parallel -series  system  is  analogously  defined  with  each  subset 
being  a series  system  and  the  totality  is  combined  into  a parallel  system. 

An  r out  of  k system  functions  if  and  only  if  at  least  r of  the  k 


components  function. 

There  is  an  extensive  literature  on  the  theory  of  coherent  systems. 
These  ideas  were  first  formulated  in  Z.  W.  Birnbaum,  J.  D.  Esary,  and  S.  C. 
Saunders  [7],  in  which  they  extended  some  ideas  of  E.  F.  Moore  and  C.  E. 
Shannon  [29].  For  many  further  details,  the  reader  is  referred  to  R.  E.  Barlow 
and  F.  Proschan  [2],  [4]  and  James  P.  Lipp  [18]. 

Now  let  X.,  i = 1,  2, . . . , k be  a family  of  mutually  independent 
Bernoulli  random  variables,  and  P{X^  = l}  = p,  = 1-  P{Xi  = O},  i = 1,  2,  . . . , k . 
Then  we  define  the  reliability  function  h(Pj,  p2, . . . , p^)  of  the  system  <p  as 


(1) 


h(p)  = e Mxl,x2,...,xk)}  = P W(Xj,  X^, . . . , Xfc)  a 1} 


where  p = (p  , p , . . . , p ) . By  (1),  we  see  that  h(p)  is  the  probability 

1 b iC 

that  the  system  is  functioning,  which  coincides  with  the  interpretation  of  the 
term  reliability  in  the  customary  engineering  context. 

It  is  easily  seen  that  for  the  series,  parallel,  series-parallel,  paral- 


lel-series and  r out  of  k systems, 


(2) 


h(P)  = v(P)  , 


upon  extending  the  domain  of  cp  to  the  k-fold  product  of  unit  intervals  and  extend- 
ing the  range  oi  <p  to  the  unit  interval.  On  the  set  {0<p.  <i,  i = l,  2,  . .. . , k}, 
h(p)  is  increasing  in  each  p,,  i = 1,  2, . . . , k,  whenever  <p  is  a coherent  system. 
The  techniques  to  be  described  in  this  survey  are  all  concerned  with 

<N/ 

methods  for  statistical  inference  concerning  cp  ■-  h(p)  . 

Before  turning  to  the  problems  of  statistical  inference,  I would  like  to 
call  attention  to  some  concepts  which  I feel  have  potential  utility  in  statistical 
inference  for  systems  reliability,  but  thus  far  appear  not  to  have  been  really 
exploited. 

3.  The  relative  importance  of  individual  components  in  a system.  Z.  W. 

Birnbaum  [6]  defined  two  notions  of  the  importance  of  a component  in  a system. 

The  reliability  importance  of  component  i,  1 < i < k is  defined  as 


(3) 


and  the  structural  importance  of  component  i is  defined  as 


(4) 


V‘>  * oif 


1 

2 


In  R.  E.  Barlow  and  F.  Proschan  [3],  the  notion  of  a fault  tree  was 
employed  to  justify  a different  definition  of  the  importance  of  a particular 
component  in  a system.  In  the  notation  and  terminology  of  this  paper,  this 
notion  of  the  importance  of  the  ith  component  can  be  interpreted  as  the 
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probability  that’  component  i caused  the  failure  of  the  system  given  that 
the  system  has  failed.  In  R.  E.:  Barlow  and  F.  Proschan  [ 3 ],  this  concept 
was  introduced  for  continuous  life  distributions,  rather  than  the  model  em- 
ployed here  in  terms  of  Bernoulli  random  variables.  Thus  for  Barlow  and 
Proschan  the  probability  that  more  than  one  component  caused  the  failure  of 
the  system  is  zero.  However,  for  the  Bernoulli  model  of  this  paper,  such  is 
not  the  case.  Therefore,  some  modifications  of  this  definition  are  needed. 
Hence,  we  would  like  to  propose  the  following  ad  hoc  alternative  in  this  case 
and  define  the  importance  of  component  i by 


151  WmS'm}"  l T l P{y(X)=0,  X = X = X «...  *X  =0} 

1 WI  u;  j=l  1 1<Jj<,..<«  J<k,  1 2 j-1 

t #i,  1 < m < j - 1, 

>n  — ’ 

This  is  equivalent  to  assigning  the  cause  of  failure  to  a single  com- 
ponent at  "random"  when  more  than  one  component  has  failed  and  the  system 
has  failed.  The  result  obtained  using  (5)  will  not  in  general  coincide  with 
the  corresponding  result  that  would  be  obtained  if  specific  lifetime  distribu- 
tions for  the  individual  components  were  assumed. 

For  example,  if  the  system  is  a series  system  with  two  components,  then 
the  component  which  causes  the  failure  is  the  first  component  to  fail.  If  Y 
and  l are  the  waiting  :imes  to  failure  and  F(y)  and  G(z)  are  the  respective 
waiting  time  distributions,  then  the  probability  that  the  first  component  causes 
the  system  failure,  given  that  both  components  have  failed  on  or  before  time  T 
is  given  by 


PfY<  z|Y<  T,Z<  T}  = [F(T)G(T))-i  / F(z)dG(z)  . 

0 

It  is  easily  seen  that  in  general  P{Y>z|Y<T,  Z<T}^^;  however,  if 

F(y)  = G(y),  y > 0 or  if  F{y)  and  G(z)  are  uniform  distributions  on 

[0,  u ],  (0,u  ] respectively  with  T<  min[  u , u ],  then  P{Y<  Z |y<  T,  Z<  T}  = 
y z y z 2 

Still  another  alternative  for  a notion  of  the  importance  of  the  ith 
component  is  suggested  by  the  following  considerations.  In  many  applications, 
the  objective  is  to  produce  a system  of  hign  reliability,  hence  it  is  natural  to 
suppose  that  the  reliability  of  each  component  should  be  high.  In  this  case, 
improvement  in  the  system  reliability  is  affected  principally  by  the  behavior 
of  h(Pj,  p^,  . . . , p^)  as  each  pi  tends  to  unity.  Thus,  a further  suggestion 
is  to  define  the  importance  of  the  ith  component  by 


pr  =pk=1 

where  the  indicated  differentiation  is  to  be  interpreted  as  a left -derivative. 

It  is  hoped  that  these  suggestions  prove  to  be  useful.  As  of  this  date, 
I have  not  yet  explored  their  properties  or  their  applications  to  statistical 
methods  of  assessing  reliability. 

4.  Methods  for  statistical  inference  for  the  reliability  of  series  systems.  To 
simplify  notation,  we  define  Qj  = 1 - Pj  > i = 1,  2, . . . , k . Then  q.  is  the 
probability  that  the  ith  component  is  defective.  Assume  that  N.  Bernoulli 
trials  have  been  made  on  the  ith  component.  Then,  if  is  the  number  of 
successes  observed  on  the  ith  component,  we  have 


(6) 


T * 9h(P) 

i " 9Pi 
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(7) 


k Ni  xi  Nrx< 

pV*i ^-V-TT'x.x*.  V ■ V1'2'  -’"!  ■ 

1 = 1,2,...  k . 

Clearly  (X.,X  , . . . , X ) is  a sufficient  statistic  and  any  statistical  technique 

1 4 In. 

need  employ  only  the  data  set  {(x^,  N^),  (x^,  N^),  . . , , (x^,  N^.)}  .. 

We  first  treat  the  case  of  a series  system.  Then  since 

^ k 

h(p)  = TT  P.  , 

i=l  1 

a natural  choice  as  an  estimator  of  h(p)  is 

k k 

(8)  h(p)  = TT  (x  /N  ) = TT  (P.)  , 

i=l  11  i=l 

which  is  both  the  maximum  likelihood  estimator  and  the  minimum  variance 
unbiased  estimator.  However,  this  estimator  can  have  some  undesirable 
properties.  In  the  case  of  parallel  systems,  its  behavior  as  a point  estimator 
can  be  very  poor  and  we  present  an  example  to  this  effect  in  Section  5.  Many 
writers  have  attempted  to  exploit  (8)  as  a basis  for  determining  a lower  con- 
fidence limit  to  h(p)  . Naturally,  all  confidence  limits  to  be  described 
depend  on  the  confidence  coefficient  1 - a;  the  dependence  on  a will  be 
suppressed  in  the  notat.on  that  will  be  employed. 

We  now  present  a summary  of  various  techniques  that  have  been 
proposed. 

F.  Nishime  [32]  computes  the  lower  confidence  limit  L ^ for  each  p. 
using  the  binomial  distribution..  Let 


-7- 


...  , K 

i* " E X £, 


and  let  L'  be  the  lower  confidence  limit  for  the  binomial  parameter  computed 

k k 

from  the  binomial  distribution  with  N = V N and  x . Then 

i=l  1 i-1  1 


Lh  = (k'/k*  TT  l • 

i=l 

The  rationale  for  the  method  is  by  no  means  obvious.  If  the  are  very 

disparate,  this  can  give  unreasonable  answers. 

N.  R.  Garner  and  R.  W.  Vail,  Jr.  [13]  have  proposed  another  method  of 

combining  component  data  to  obtain  a lower  confidence  limit.  Let  N=  min  N. 

* k * l<ifk 

let  y = Y,  (N  -x  ),  that  is,  y is  the  total  number  of  failures  observed. 

i=1_  * 

Further  let  y = max  (N  -x  } . Then  if  y < N,  the  lower  confidence  limit  is 
1<  i<k 

given  by 


- j /iv  - y 

Lh  = E * L(“*(S5)  b(z?N,h(p))/ Yj  * b(w;N, h(p)) 
z = N-y  / w=N-y 


where  b(x;M,p)  is  the  binomial  probability  function  with  parameters  M and 
(N) 

p and  L (x)  is  the  lower  confidence  limit  for  the  binomial  parameter  when  x 
successes  are  observed  in  N Bernoulli  trials.  It  can  easily  be  seen  that  this 
produces  optimistic  (non-conservative)  answers  in  some  cases. 

D.  L.  Lindstrom  and  J.  H.  Madden,  (see  D.  K.  Lloyd  and  M.  Lipow 
[19])  have  proposed  the  following  orocedure.  Interpret  Nh(p)  as  the  number 
of  successes  in  N Bernoulli  trials.  That  is,  Nh(£)  is  interpreted  as  the 


-8- 


observation  from  a binomial  distribution  with  parameters  (N,  h(p))  and 

this  is  employed  to  obtain  a lower  confidence  limit  Lh  for  h(p)  . 

R.  J.  Buehler  flO]  suggested  a technique  for  obtaining  ••optimal-  one- 
sided confidence  intervals  for  the  product  of  binomial  parameters.  His  tech- 
nique, however,  requires  that  the  possible  observed  k-tuples  be  ordered. 
Thus,  if  k = 2,  there  are  (N}  + 1)(N2  +1)  sample  points  that  must  be  ordered 
in  a sequence.  Specifically,  let  c (Xj,  xj  satisfy 


(10) 


P^°N  , N (X1’  X2)  - P1P2  - ^ - 1 “ a • 

X u 


Tnen  the  numbers  cN  (x  x ) form  a set  of  lower  confidence  limits 

1»  2 

for  p^p^  . These  are  to  be  ordered  so  that 


(11) 


CNrN2(Xl 


(J) 


< c 


«<*)  x<‘>, 

N,,  N2|X1  >x2  > 


whenever  i < j,  where  1 < i < j < (N,  ♦ 1)(N.,  +1)  . The  unique  sequence  sucn 
that  the  c M <x,-x2)  are  uniformly  largest  subject  to  the  specific  ordering 

X c 

chosen  is  given  by 


(12) 


3NrN2(Xi  >»x2i>)  = inf{pip2:  *XNj,  PH  xJJ))  b{N2,  P2sx^)  > a}  f 


where  the  index  j is  determined  by  the  ordering.  Since  (12)  depends  on  the 
choice  of  ordering,  the  question  of  how  a particular  ordering  should  be  chosen 
is  by  no  means  clear.  Buehler  has  suggested  the  ordering  based  on  the  criterion 


implies 


cn/’V 


•TVS) 


:N^{X2>^) 


\ N2|X1-  V 5 % K2 


(XJ» 


*2> 


where  c {x,  4i-a  ) denotes  the  lower  confidence  limit  for  a single  binomial 

W1  ' 

parameter  with  confidence  coefficient  *Jl-a  . 

Under  suitable  conditions  the  binomial  distribution  may  be  replaced 
by  the  Poisson  distribution,  which  offers  some  computational  advantages. 

In  general,  Buehler's  method  is  difficult  to  utilize,  particularly  for 
k > 2,  because  of  the  computational  effort  involved  in  ordering  the  sample 
outcomes. 

J.  L.  Epstein  [12]  considered  the  case  k = 2 and  compared  the 

estimators  hj  = (^  + 1)(X2  + 1)/*^  and  h?  = X^/NjN.,.  Let 

Ti  = {(x^  x2;:N1N2h1  = i}  and  S.  = {(x^  x2):  = j}, 

i < i < (Nj  +1)(N2  +1),  0<j<N^N2.  Then  the  sets  and  determine 

partitions  of  the  sample  space  which  we  denote  by  Z and  S respectively. 

The  sets  T.  and  S are  ordered  so  that  a monotonicity  condition  is 
1 3 k k 

satisfied,  that  is  if  (x.,x)«  U S then  (y.,y  )«  US  for  all 

i=l  1 Z i=l  * 

y2<  x2.  Epstein  computes  some  numerical  examples  of  confidence 
limits  and  it  appears  from  his  investigation  that  the  partition  Z possesses 
some  advantages  over  the  partition  8. 

In  an  unsigned  and  undated  mimeographed  report  which  has  come  to 
my  attention  (listed  in  references  as  Anonymous  [1]),  the  following  technique 
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has  been  suggested.  Let  = X./N^  i = 1,  2, . . . , k and  consider 

k k 

log  TT  P.  --  log  h(p)  = V log  p . 
i=l  i=l  1 

k 

Then  since  ^ log  p(  is  a sum  of  independent  random  variables,  we  assume 
i=l 

that  the  central  limit  theorem  can  be  employed  and  use  a normal  approximation 

to  obtain  the  lower  confidence  limit  L.  . Note  that  the  mean  and  variance 

-h 

of  log  pi  do  not  exist.  The  author  suggests  using  the  "relationship  between 
the  moments  of  the  normal  and  the  moments  of  the  lognormal  distributions  to 
determine  the  moments  of  log  p^". 

One  of  the  techniques  in  the  "folklore"  of  reliability  theory  has  been 
the  use  of  Poisson  approximations  when  the  reliability  of  each  component  can 
be  assumed  to  be  "high".  Intuitively, this  technique  can  be  sketched  as 
follows.  Write 

k 

1 - h(p ) = 1 - TT  {1  - q,),  q = 1 - p , i = 1,  2,  . . . , k . 

i=I 

Then  set 

qi  " X/Ni’  i = 1,  2, . . . , k . 

Accordingly,  we  can  write 

k k 

1 - h(p)  1 - Tf  (1  - X./N  ) - V X./N  . 
i=l  i=l 


Thus,  if  we  let  1 = 1,2,  ...,k,  we  can  regard  the  Y.’s  as 

being  independently  Poisson  distributed  random  variables  with  parameters  X . 

i 

This  idea  has  been  extensively  exploited  'n  the  statistical  literature  and  we 


provide  three  illustrations. 


i 


In  the  book  by  B.  V.  Gnedenko,  Yu.  K.  Belyayev  and  A.  D.  Solov'yev 
[14],  the  following  method  is  suggested,  which  the  authors  attribute  to  R.  A. 
Mirniy  [28]. 

k _ k 

Let  <p  ~ ~Y  log  p and  let  <p  = max(-V  log  p ) subject  to  the  con- 
1=1  k i=l 

ditions  > 0,  i = 1,  2, . . . , k and  £ < A . . . , Yk)  , 

where  P{;  \.  < A } > l~a ; a.  is  the  solution  in  A of  P {Z< 

k i=l  1_  Ua  • Ua  A “ 

y Y } = a and  Z has  the  Poisson  distribution  with  parameter  A. 
i=l 

Then  they  obtain 


L.  = e 


where 


” =A1-„(Y1’V2 • 


I.  V.  Pavlov  [34]  provides  a comprehensive  discussion  of  Poisson 
methods  for  series  reliability.  L.  N.  Bol'shev  and  E.  A.  Loginov  [8]  use  the 
Poisson  approximation  and  determine  the  lower  confidence  limit  by 


L = 1 . -i-  v2 

-h  v« 


(l-«)  , 


2 

where  X2y*+2  denotes  the  (l-«)  point  of  the  chi-square  distribution  with 

* 

2y  +2  degrees  of  freedom.  • 

The  evidence  accumulated  thus  far  suggests  that  these  are  likely  to 
produce  conservative  results. 

A conservative  method  in  keeping  with  the  above-mentioned  "folk 
theorem"  of  reliability  theory  is  described  in  the  paper  by  A.  Winterbottom  [41]. 


•■*•*!***)&»*, ittitliii, n,  . 


A method  known  as  the  key  test  results  method  was  introduced  by 


K.  A.  Weaver  [39]  and  described  oy  him  for  the  case  = N2  = • • • = = N 

and  extended  to  arbitrary  N,,  l<i<k  by  A.  Winterbottom  [40],  [41]. 

Specifically,  define 

k 

(16)  G(h(p);  p)  = P~  {hJT^/Nj))  > h{p)}  , 

that  is,  the  probability  of  observing  an  estimator  of  the  form  (8)  greater  than 
or  equal  to  the  observed  value  h(p)  . Then 

(17)  { P s G(h(p);  p)}  > a 

constitutes  the  set  of  parameter  points  that  would  be  accepted  in  a significance 

<w  cv 

test  of  size  a . The  lower  confidence  limit  is  given  by  inf{h(p):  G(h(p),  P)^®}  • 
Winterbottom  gives  a graphical  method  for  solving  (17)  when  k = 2 and 
comments  on  the  difficulties  encountered  when  k > 2 . 

With  no  loss  of  generality,  we  can  let  N = < . . . < . Then 

the  key  test  results  are  those  with  0 < x,  = i = 2,  . . . , k . Using 

the  key  test  results,  Winterbottom  shows  that 

Ni 

(18)  Lh  = !nf{p:  £ b(z;N  P ) > a)  , 

Z = Xj 

where,  if  h(p)  = u,  interpret  the  experimental  outcome  as  x^  = Nu  and 
compute  the  lower  confidence  limit  from  (18). 

We  conclude  this  section  by  briefly  mentioning  the  work  of  C.  Mack. 

In  a sequence  of  four  papers,  he  treated  various  problems  concerned  with 


V 


determining  confidence  limits  for  products  of  binomial  parameters.  In  the 
first  of  these  [20],  he  provides  a number  of  technical  improvements  to  the 
likelihood  ratio  technique  proposed  by  Madansky  [24].  The  third  [22]  treats 
the  case  of  the  product  of  two  binomial  parameters  (k  = 2)  . The  technique 
is  basically  a computational  method,  which  can  be  used  when  the  number  of 
failures  is  small.  Tables  of  formulas  and  numerical  values  are  given  to 
facilitate  the  computation. 

5.  Methods  for  statistical  inference  for  parallel  systems.  For  parallel  systems, 

we  write  1 - h(p  ) = | |q^  . Thus,  there  is  a natural  duality  between  parallel 

and  series  systems  and  many  statistical  techniques  for  series  systems  may 

be  transformed  into  equivalent  methods  for  parallel  systems  upon  replacing 

Xj  by  N.  -Xj,  p.  by  q^  and  conversely.  Then  lower  confidence  limits  for 
k k 

TTp,  become  upper  confidence  limits  for  TT  q.  . This  duality  has  been  exploited 
i=l  i=i  1 

in  the  papers  by  R.  J.  Buehler  [10],  J.  L,  Epstein  [12],  Anonymous  [1]  and  C.  Mack  [20]. 

Note  that  in  the  case  of  the  suggestion  by  Anonymous  [1],  if  no  failures 
have  been  observed  for  any  component,  then  the  data  cannot  be  used.  This  is 
somewhat  contradictory  in  that  one  should  feel  happy  that  no  failures  are 
observed,  yet  this  prevents  utilization  of  the  experimental  data. 

We  now  consider  methods  specifically  designed  for  parallel  systems. 

B.  Harris  [15]  assumed  that  Yj,  Y 2,  . . . , Y^  are  independently  Poisson 
distributed  with  parameters  X,,  . . . , X^,  where  Y^,  . . . , Y^  are  to 

be  regarded  as  the  number  of  observed  failures  for  each  of  the  components. 

Further  Xj  is  to  be  regarded  as  N^,  i = 1, 2, . . . , k . Then,  if 
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k k 

(19)  9 = TT  X = TT  (q./N  ) , 

i=l  i=l 

the  conditional  distribution  of  given  ^-Y^,  . . . , Y^-Y^  depends  only  on 
9 and  not  on  the  individual  X^'s  . This  observation  may  be  employed  to 
obtain  uniformly  most  powerful  similar  tests  and  uniformly  most  accurate 
unbiased  confidence  intervals  for  0 . The  contitional  distribution  referred 
to  above  is  known  as  the  generalized  incomplete  modified  Bessel  distribution 
and  is  tabulated  in  B.  Harris  and  A.  P.  Soms  [16]. 

In  the  remaining  two  papers  by  C.  Mack  [21],  [23],  methods  for  use 
with  parallel  systems  are  described.  Both  are  restricted  to  the  case  k = 2 , 
the  first  giving  appropriate  methods  for  use  when  the  number  of  observed 
failures  is  small.  The  second  of  these  is  largely  devoted  to  tables  to  facilitate 
computation  of  the  upper  confidence  limit  for  the  probability  of  failure  of  the 
system. 

We  digress  briefly  to  comment  on  the  point  estimation  of  h(p)  . The 

minimum  variance  unbiased  estimator  and  maximum  likelihood  estimator  of 
^ k k 

1 - h(p)  = TT  q.  is  1 - h(p)  = TT  (N  -X  )/N  , that  is,  the  product  of  the 
i=l  1 i=l  1 1 1 

observed  proportion  of  failures  of  the  individual  components.  However,  if 
any  component  exhibits  no  failures  in  the  experiment,  then  h(p)  = 1 . Thus 
for  data  like  Nj  = lOC^  = 75,  N_  = 100,  X2  = 70,  N?  = 5,  X3  = 5,  we  have 

A 

h(p)  = 1,  which  is  inherently  a poor  estimator. 

6.  Methods  which  may  be  used  for  arbitrary  coherent  systems.  A.  Mad  an  sky 
[24]  treated  the  use  of  Wllk's  likelihood  ratio  statistic  for  the  construction  of 
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confidence  sets  for  the  reliability  function  h(p  ) of  series,  parallel,  and 
series -parallel  systems.  He  described  the  use  of  Lagrange  multipliers  to 


i 


accomplish  the  necessary  constrained  maximization.  He  also  described 
what  he  called  the  linearization  method,  that  is,  expanding  the  maximum 
likelihood  estimator  h(p)  up  to  terms  of  first  degree  and  thus  obtaining  the 
statistic 

(20)  Z = (h(p)  - h(p))/Sn(0)  , 

A A 

where  0)  is  an  estimator  of  the  asymptotic  standard  derivation  of  h(p)  . 
According  to  the  usual  large  sample  theory,  Z is  asymptotically  normally 
distributed  with  zero  mean  and  unit  variance. 

This  line  of  investigation  was  continued  by  J.  M.  Myhre  and  S.  C. 
Saunders  [30],  who  compared  the  likelihood  ratio  with  the  linearization  method 
and  concluded  that  the  likelihood  ratio  method  was  the  more  accurate  of  the 
two  methods. 

Myhre  and  Saunders  [31]  subsequently  extended  the  work  of  Madansky 
on  the  likelihood  ratio  statistic  to  arbitrary  coherent  systems. 

R.  G.  Easterling  [11]  proposed  a modification  of  the  linearization  method 

2 A 

in  that  the  variance  estimate  0)  of  (20)  is  replaced  by 

(21)  a 2 = h(p)  (1  -h(p))/n  , 

where  n is  called  the  pseudo-sample  size.  The  estimate  n is  obtained  by 

A 2 

equating  <r  to  the  estimate  of  the  asymptotic  variance  of  the  maximum 
likelihood  estimate.  The  data  is  then  to  be  regarded  as  a sample  from  a 
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binomial  distribution  with  x = n h(p)  successes  and  n observations  and 


this  is  used  to  obtain  the  lower  confidence  limit  L,  . 

-h 

These  methods  all  have  the  serious  drawback  that  they  perform  very 
badly  when  the  reliability  is  high,  that  is,  near  unity.  Yet  I feel  that  this  is 
the  region  of  the  parameter  space  which  is  of  greatest  interest  in  applications. 

A substantially  different  technique  was  proposed  by  Joan  R.  Rosenblatt 
[35].  Given  the  sample  sizes  N , N , . . . , N and  the  data  X = x X = x , 

i K 11  C*  C* 

. . . , X^  = x^,  the  author  discussed  simulation  methods  of  statistical  infer- 

ence  for  h(p)  . One  method  of  simulation  is  to  select  possible  systems  by 

sampling  without  replacement  from  each  component.  Then  N systems  are 

constructed  and  thus  N values  of  <p(x)  are  obtained.  Then  the  average 
N ~ 

Yj  <p(x  )/N  is  an  unbiased  estimator  of  h(p)  and  by  the  central  limit 
J = 1 J 


theorem,  this  statistic  has  an  asymptotically  normal  distribution,  permitting 

statistical  inference  to  be  easily  carried  out. 

The  author  notes  properly  that  this  is  an  inefficient  use  of  the  available 

k 

data.  With  the  same  data  it  is  possible  to  construct  N = TT  N systems, 

i=l 

obtaining  thereby  N values  of  <p(x)  . The  average  of  these  is  an  unbiased 
estimator  of  h(p)  . Further,  she  shows  that  this  is  a U-statistic  in  the 
sense  of  Hoeffding  and  that  ^ N (U  - h(p))  is  asymptotically  normally  dis- 
tributed whenever  N^/N  -*■  c^,  i = 1,  2,  . . . , k . Also,  it  is  shown  that 
(j-U  < — ^h(p)(l  -h(p)^.  Thus  an  asymptotic  lower  confidence  limit  for  h(p) 
can  easily  be  obtained.  In  general,  this  will  be  a conservative  procedure. 

7.  Bayesian  methods.  There  is  an  extensive  literature  on  the  use  of  Bayesian 
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methods  in  reliability  analysis.  In  this  section  we  provide  a brief  description 
of  some  of  the  literature  in  this  area. 

M.  D.  Springer  and  W,  E.  Thompson  [36],  [37]  obtain  the  Bayesian 
solution  for  series  and  parallel  systems  by  employing  beta  priors  for  each 
component.  They  employ  the  Mellin  transform  technique  to  combine  the 
posterior  distributions  for  the  individual  components  and  this  is  inverted  to 
obtain  the  posterior  distribution  of  h(p  ) . The  resulting  formula  is  fairly 
intractable  and  the  authors  suggest  the  use  of  digital  computers  to  evaluate 
this  and  thus  to  obtain  confidence  limits.  An  elementary  discussion  of 
Bayesian  methods  for  series  reliability  using  Beta  priors  for  the  individual 
components  may  be  found  in  A.  M.  Breipohl,  R.  R.  Prairie  and  W.  J.  Zimmer 
[9]- 

In  W.  E.  Thompson  and  E.  Y.  Chang  [33]  the  use  of  Chebyshev  poly- 
nomials is  suggested  as  a numerical  method  for  evaluating  the  posterior  dis- 
tribution of  the  system  reliability.  The  paper  was  written  for  the  purpose 
of  providing  a Bayesian  method  for  the  reliability  of  parallel  system  when 
each  component  has  an  exponential  lifetime  distribution.  However,  the  same 
technique  can  be  employed  in  precisely  the  same  manner  v/henever  the  posterior 
distribution  of  the  reliability  of  the  individual  components  has  the  form 

(p  +l)ai+1  (3  or 

(22)  f(pi)  = T^I+TT  Pi  ln(I/pi)  * ari  > 1»  Pj  > “1»  i = 1,  2,  . . . , k . 
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J.  B.  Parker  [33]  pointed  out  that  one  obtain?  different  results  if  one 
chooses  an  uninformative  prior  for  the  system  reliability  than  if  one  chooses 
the  same  uninformative  prior  for  the  reliability  of  each  individual  component.  For 
example,  a uniform  prior  might  be  an  illustration  of  such  a prior.  Parker 
presents  a table  comparing  component  priors  and  system  priors. 

The  use  of  Monte  Carlo  methods  in  conjunction  with  a Bayesian  analysis 
is  described  in  A.  Hinojosa,  D.  V.  Mastran  and  D.  Spreen  [17]  and  in 
D.  V.  Mastran  [27].  A prior  distribution  is  assumed  for  the  reliability  of 
each  individual  component.  Then  the  experimental  data  is  used  to  determine 
the  posterior  distribution.  In  [17]  uniform  priors  are  used  and  in  [27]  Beta 
priors  are  assumed.  Then  an  estimate  of  h(p)  is  obtained  by  simulation, 
that  is,  a random  reliability  is  chosen  for  each  component  by  sampling  from 
the  respective  posterior  distributions.  Repeating  this,  an  estimate  of  the 
posterior  distribution  of  the  system  reliability  is  obtained.  A beta  distribution 
is  fitted  to  this  empirical  posterior  by  the  method  of  moments  and  the  lower 
confidence  bound  can  then  be  determined. 

In  [17]  the  following  problem  is  also  discussed.  Consider  a series 
system  with  k = 2 and  let  = N2  and  assume  that  no  failures  have  been 
observed.  This  can  also  be  regarded  as  N trials  of  the  complete  system 
with  no  observed  failures.  The  authors  suggest  that  the  second  interpretation 
would  give  better  reliability  estimates,  however,  no  theoretical  justification 
for  this  remark  is  provided  in  the  paper. 

D.  A.  Berry  [5]  considered  sampling  plans  which  minimize  the  Bayes 


risk,  taking  the  sampling  costs  into  account.  Specifically  he  considers  the 
loss  function 

~ a , k 

(2  3)  Lfn(p),  h(p),  Nj, . . Nk)  = (h(p)  - h(p))"1  + £ N , 

where  c(  > 0 is  the  cost  of  a single  observation  on  the  ith  component. 

The  specific,  details  are  carried  out  for  series  and  parallel  systems  with 

k --  2 and  uniform  priors  for  the  reliabilities  of  the  individual  components. 

mi 

This  is  extended  to  k components,  k > 2 and  priors  of  the  form  p 

In  N.  R.  Mann  and  F.  E.  Grubbs  [2  5],  the  posterior  distribution  of 
-In  h(p)  is  approximated  by  a chi-square  distribution  to  obtain  lower  con- 
fidence limits  for  series  and  parallel  reliability.  While  computationally 
straightforward,  the  procedure  is  tedious  to  describe  and  the  details  are 
omitted.  The  sa-'e  procedure  is  also  described  in  the  book  by  N.  R.  Mann, 

R.  Schafer,  and  N.  Singpurwalla  [26].  The  authors  claim  that  their  method 
provides  excellent  agreement  with  procedures  known  to  possess  some  opti- 
mality property,  such  as  Harris  [15]. 

8.  Decision  theoretic  models.  To  date,  I have  no  evidence  that  decision 

theoretic  techniques  have  been  employed  with  the  exception  of  Bayesian 

decision  methods.  Here,  the  only  loss  function  that  has  been  utilized  is 

L(h(p),  h(p))  = (h(p)  - h(p))  (see  [5],  [9]).  I regard  this  as  an  inappropriate 

loss  function.  If  the  system  reliability  is  . 99,  then  on  the  average  it  should 

faii  one  tnne  in  100,  whereas  if  the  system  reliability  is  -.  999,  it  should  fail 

about  one  time  in  1000  and  hence  is  ten  times  as  good.  Thus,  the  loss  func- 

— -1 

tion  should  depend  on  how  well  one  estimates  (i-h(p))  . Possibilities  for 


this  are  loss  functions  of  the  form 


L(h(p),  h(p))  = Id-h(p))'1  - (l-h^))'1!",  a > 0 . 

It  appears  to  me  that  this  is  a direction  well  worth  considering. 

9.  Concluding  remarks.  Of  necessity,  this  survey  is  far  f.om  complete. 

There  are  several  reasons  for  this.  Much  of  the  literature  of  reliability  theory 
is  contained  in  proprietary  publications  and  is  not  in  the  usual  open  literature. 
Hence,  many  articles  could  not  be  obtained  easily.  Similar  inaccessibility 
applies  to  papers  appearing  in  a number  of  foreign  journals  which  could  not 
be  easily  obtained.  In  addition,  some  readily  accessible  articles  have  surely 
been  inadvertently  overlooked. 

Finally,  it  is  clear  that  there  are  an  enormous  number  of  methods  avail- 
able and,  to  the  best  of  my  knowledge,  there  has  been  no  definitive  work  to 
date  which  provides  a reliable  and  comprehensive  comparison  between  the  different 
techniques.  Many  papers  compare  numerical  values  for  the  lower  confidence 
limit  for  hjp)  for  several  techniques,  but  this  is  an  incomplete  comparison, 
since  the  true  confidence  coefficient  has  only  been  approximated  by  the 
stated  confidence  coefficient,  and  the  former  is  in  general  unknown. 
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